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Abstract — Cellular systems that employ time division duplexing 
(TDD) transmission are good candidates for implementation 
of interference alignment (IA) in the downlink since channel 
reciprocity enables the estimation of the channel state by the 
base stations (BS) in the uplink phase. However, the interfering 
BSs need to share their channel estimates via backhaul links of 
finite capacity. A quantization scheme is proposed which reduces 
the amount of information exchange (compared to conventional 
methods) required to achieve IA in such a system. The scaling 
(with the transmit power) of the number of bits to be exchanged 
between the BSs that is sufficient to preserve the multiplexing 
gain of IA is derived. Q 

I. Introduction 

Interference alignment (IA) is known to achieve the optimal 
degree of freedom (DoF) in interference channel (IC). This 
implies that at high signal to noise ratio (SNR) regime, IA 
improves the system throughput compared to the conventional 
orthogonal medium-sharing methods. However, implementa- 
tion of IA in existing systems faces a lot of challenges. The 
necessity of channel state information (CSI) at the transmitters 
is one of the major issues which is not practical in many 
situations. Moreover, the accuracy of the CSI provided to the 
transmitters should increase as the power increases in order 
to guarantee the DoF gains promised by IA [1|. Therefore 
transmission systems which acquire the CSI through feedback 
(such as frequency division duplex (FDD) systems) become 
less favorable for implementation of IA since the potential 
gains only appear at high powers. 

For time division duplex (TDD) systems, every base station 
can estimate its downlink channels from the uplink transmis- 
sion phase thanks to reciprocity. However, this local CSI is not 
sufficient, and the BSs need to share their channel estimates 
which can be carried out through backhaul links between BSs. 
These backhaul links generally have limited capacity, which 
should be exploited efficiently. 

In scenarios where the receivers quantize and feed the 
CSI back to the transmitters, the problem is explored over 
frequency selective channels for single-antenna users in |fl] 
and for multiple-antenna users in [2 |. Both references provide 
DoF-achieving quantization schemes and establish the required 
scaling of the number of feedback bits. For alignment using 
spatial dimensions, 1 3 ] provides the scaling of feedback bits 
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to achieve I A in MIMO IC. For the broadcast channel, the 
scaling of the feedback bits was characterized in |0J. In 0, 
quantization of the precoding matrix using random vector 
quantization (RVQ) codebooks is investigated which provides 
insights on the asymptotic optimality of RVQ. From another 
point of view, |6| provides an analysis of the effect of 
imperfect CSI on the mutual information of the interference 
alignment scheme. 

In this paper, we focus on the scenario where the BSs have 
perfect but local CSI, and must share it to achieve IA. A 
CSIT sharing scheme is proposed which reduces the amount 
of information exchange required for interference alignment 
in such a system. The scaling (with the transmit power) of the 
number of bits to be transferred which is sufficient to preserve 
the multiplexing gain that can be achieved using perfect 
CSI is derived. Moreover, a heuristic method is proposed to 
demonstrate the achievability of the DoF by simulations. 

Notation: Boldface lowercase and uppercase letters indicate 
vectors and matrices, respectively. I at is the N x N identity 
matrix. The trace, conjugate, Hermitian transpose of a matrix 
or vector are denoted by tr(-), (•)*, (-) H respectively. The 
expectation operator is represented by E(-). The Frobenius 
norm and the determinant of a matrix are denoted by || • ||p 
and | ■ | respectively. The maximum eigenvalue of a matrix 
is represented by A max (-). A diagonal (resp. block diagonal) 
matrix is denoted by diag(-) (resp. Bdiag(-)) with the argu- 
ment elements (resp. blocks) on its diagonal. Af(0,l) (resp. 
CJ\f(Q, 1)) denotes the real (resp. circularly symmetric com- 
plex) Gaussian distribution with zero mean and unit variance. 

II. System Model 

An interference channel is considered in which K base 
stations (BS) and K users (one user in each cell) are consid- 
ered as transmitters and receivers, respectively. For the sake of 
simplicity of the exposition, we focus on the symmetric case, 
and assume that each BS has M antennas while each user is 
equipped with N antennas. These results trivially generalize to 
non-homogeneous antenna numbers and per-user DoF as long 
as IA is feasible for the chosen problem dimensions. Each BS 
employs a linear precoder to transmit d data streams to its 
user. The received signal at user i is denoted by 

K 
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in which ELy 6 ^JvxAf j s tne channel matrix between 
BS j and user i, Vj £ c Mxd and Xj e C d are the 
precoding matrix and the data vector of BS j, respectively. 
Furthermore, rij is the additive noise at user i whose 
entries are distributed according to CAf(0, 1). Assuming 
E (xj-xj 1 ) = -j Id, j = 1> • • • , K and using truncated unitary 
precoders, the transmit power for each BS is equal to P. We 
further assume that the elements of the data symbol are i.i.d. 
Gaussian random variables. The channels are assumed to be 
generic 0; in particular, this includes channels with entries 
drawn independently from a continuous distribution. 

III. CSIT SHARING FOR IA 

Let us consider TDD transmission, which enables the BSs 
to estimate their channels toward different users by exploiting 
the reciprocity of the wireless channel. Specifically, we assume 
that the jth BS estimates the channel matrices ELy, i = 
l,...,K,i 7^ j (denoted by local CSI) from the uplink 
phase, via reciprocity. We first assume that local CSI is known 
perfectly at BS j. However, global CSI (excluding the direct 
channels H,j) is required in order to design IA precoders. 
In this section we consider the topology of CSI exchange in 
the network, and work under the assumption that perfect local 
CSI is conveyed from each BS to a processing node which 
computes all precoders and provides them to the BSs. 

Here we assume a feasible IA setting [8], i.e. there exist 
precoding matrices Vj, j = 1,...,K and projection matrices 
U; <= C Nxd , i = 1, K such that 

U,"IT„V, = Vi, j e {1, ...,K}, j £ i, (2) 
raak(Uf H»Vi) = d. (3) 

Condition §2^ can be rewritten as 

l! l; ; II.V\, Vj E{1,...,K}, (4) 

in which U_j = Bdiag(Ui, . . . , Uj_i, Uj+i, . . . , Ur-) and 
TT> = [HH.,... 1 HH_ 1)j ,H^ lij ,...,HH/ is a (K-l)Nx 
M matrix. 

We will further assume that (K — 1)N > M, which 
represents the cases where transmitter-side zero-forcing is 
not enough to eliminate all interference, and therefore IA is 
required. The following lemma highlights the intuition behind 
our CSI sharing scheme. 

Lemma 1: In order to design IA precoders, it is sufficient 
that each BS j sends a point on the Grassmann manifold 
G(k-i)n,m representing the column space of EL, to the IA 
processing node. 

Proof: Let Fj denote a (K — 1)N X M matrix con- 
taining an orthonormal basis of the column space of Hj, 
i.e. Hj — FjCj for some Cj (invertible almost surely for 
generic channels). According to our assumption that only the 
column space of EL is known at the central unit, we can 
assume that the central unit has only access to a rotated 
version of Fj, i.e., FjOj for some unknown unitary matrix 
Oj. We now show that alignment can be achieved based on 



the knowledge of FjOj rather than of EL,. Let us assume 
that the processing node designs a set ({Ujj-JLii V^j}f = \) 
of IA transmit precoders and receive projection filters for the 
channels {FjOj}jL r Then, 

U*j(FjOj)Vj = => XjKjFjCjCj'OjVj = (5) 
=* U ,; ,II,C, O V, = 0. (6) 

This indicates that IA is achieved over the real channel by 
using Cj 1 OjVj as precoder and Uj as the projection filter 
at user j. Assuming that Vj is transmitted from the processing 
node back to BS j, and that Oj is known at BS j since the 
reconstruction codebook of the processing node is known, the 
BS is in a position to compute the precoder Cj OjVj. ■ 
Note that the feedback of Vj from the processing node to 
BS j also takes the form of a point on Qma, and will be 
analyzed in further detail in the sequel. 

IV. CSIT SHARING OVER FINITE CAPACITY LINKS 

In this section, using the Grassmannian representation out- 
lined in the previous section, we explore several scenarios 
where CSI is quantized and exchanged between the nodes over 
finite capacity links. Three different scenarios regarding the 
CSIT sharing problem can be considered: 

I. The IA processing node is a separate central node that 
computes and distributes the IA precoders to the K BSs, 

II. One BS also acts as the IA processing node, 

III. Each BS receives all the required CSI and independently 
computes the I A precoders. 

In scenario I (Fig. [Tf a )X the CSI (in the form of Fj) is 
quantized yielding Fj and sent to the central node. The central 
node computes the precoders and provides BS j a quantized 
version Vj of Vj. Here we assume that each BS uses Nf, 
bits to quantize Fj and the central node uses N c bits to 
quantize Vj. Therefore, the total number of bits exchanged 
over the network for scenario I is equal to K(Nb + N c ). 
Scenario II can be considered as a particular example of 
scenario I where one (bi-directional) BS-central node link is 
saved; the number of bits to be transferred in the network is 
(K - l)(N b + N c ). In scenario III (Fig. [2b)), the IA solution 
is computed independently at each BS, requiring global CSI 
at each BS. Therefore each BS needs to quantize and send its 
local CSI to all other K — 1 BSs. The precoders are designed 
at the BSs and no further information exchange is required. 
For simplicity of the exposition, we focus on scenario I and 
characterize the scaling of N\, and N c with P, noting that 
a generalization of the analysis to scenarios II and III is 
straightforward. 

Let us first consider the feedback from a BS to the central 
node. BS j performs the QR decomposition Hj = FjCj 
and quantizes the subspace spanned by the columns of Fj 
using Nf, bits and sends the index of the quantized codeword 
to the central node. We further assume that the BSs and the 
central node share a predefined codebook S = {Si, Sgw,,} 
which is composed of 2 Nb truncated unitary matrices of size 
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Fig. 1. CSIT sharing, with (a) and without (b) central node. 



(K — 1)N x M and is designed using Grassmannian subspace 
packing. For simplicity, let us assume that all K codebooks 
have the same size and the powers of the transmitted signals 
and receiver noise are symmetric across the network. The 
quantized codeword is the closest point in S w.r.t. the chordal 
distance, i.e., 



?i = argmin d c (S,Fo) 
3 6 ses y 3 ' 



in which 4(X,Y) 
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is the chordal 
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distance between two points in G(k-i)n.m represented by 
truncated unitary matrices X and Y J9). The interference 
alignment problem is then solved at the central node based 
on {F,}f =1 to find ({U,}f =1 , {V,-}f =1 ) fulfilling 



U^F/V- J= 0, Vj €{1,. ..,#}• 



(8) 



We now consider the feedback of Vj from the central node 
to BS j. Using another codebook T = {Ti, T 2 w c } of 
truncated unitary matrices representing points in Gu,d< me 
central node quantizes the alignment precoder V 3 for each 
BS on Qm,<i according to 
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(9) 



and sends the corresponding index to BS j. At BS j, we define 
the total precoder as V 3 = C^F^FjVj, by analogy to the 
perfect CSI case (Lemma[T|i. Using the precoders V 3 and after 
applying the receive filter Uj to (|T), the interference leakage 
(due to the quantizations (|7]i and (0) at user i is defined as 



E 

l<j<K 



(10) 



We denote the leakage power at user i by Li = tr(E(e,;ef )) = 
tr(f Q|), where Q} 



E.tl^iUfHyV.-VfHHU, 



We 



now consider the sum over all users of the leakage powers: 
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Substituting V 3 



C-^fF.V, and H, 



Fj-Cj gives 



lU^FfF^-l 



(12) 



From ® we have V^FjVjVfVj = 0, therefore by some 
manipulations, from ( [Til l. ( TT2l we get 



E 



X" 



| F <E 



where 



F i F-)F i V j 



and 



X^ = U^F 



.(v.vf-v.vf)^ 



2 (13) 



(14) 



Using the fact that all the matrices involved in Xj and 
X 1 ^ are truncated unitary, it can be shown that ||X^||f < 



2dd c (F i ,F i ) and ||X||| F < V2dd c {Vj, Vj). Using 
bounds on the quantization error for codebooks designed by 
sphere packing, it can be shown |3| that L in dl"3T > is upper 
bounded by a constant cq independent of P when 

iV 6 = ^logP and N c =^logP, (15) 

in which G b = 2M((K - 1)N - M) and G c = 2d(M - d) 
are the real dimension of G(k-i)n,m an d Gm,<i respectively. 
Under the conditions (1151 . it is clear that the leakage power at 
every receiver would be bounded by a constant since Li < L. 

In order to establish the DoF achievable using the proposed 
CSI quantization scheme, we provide a lower bound for the 
achievable rate. First consider the following lemma: 

Lemma 2: For Nf, and N c according to (TT3T > we have, 



lim 

P^oo 



log I<j 



logP 



= d, 



(16) 



with Qi; = UfHiiViVfHgUi, almost surely. 

Proof: Note that the limit in ( TTol ) involves codebooks of 
increasing size since Nf, and N c increase with P. Qg does not 
necessarily admit a limit when F->oo due to the fact that U, 
and V; are functions of the codebook. We tackle this problem 
by resorting to an argument based on the compactness of the 
solution space, and show that there exists a series of codebooks 
of increasing size for which Q s admits a limit and is full rank 
a.s. The full proof is similar to the proof of Theorem 2 in [ 10 1 , 
and is omitted due to space constraints. ■ 

We are now in the position of proving that the proposed 
method achieves the full IA DoF: 

Theorem 1: The proposed quantization scheme, with Nb 
and N c according to (1151 1. achieves the same DoF as IA under 
perfect CSI. 
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Proof: Recall that ( fl~5b ensures that Li < cq. Therefore, 
x (f Q\) < tr (f Q|) = U < c , which yields 



Id + ^Ql 



< d\og 1 + A 



7 Q ' 



< dlog(l+c ). 
(17) 

Hence, the achievable rate using the designed precoders and 
receive filters can be lower-bounded as follows, 
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(18) 
(19) 
(20) 



where ([l9l i follows from the fact that QJ is positive semi- 
definite and the second inequality follows from (TTvT t- Com- 
bining (f20b with Lemma [2] brings us to the conclusion that 

K 



lim 



P->oc i og p 



> d, i.e. the full DoF is achieved. 
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Fig. 2. Sum-rate comparison of quantization methods, for the 
3-user MIMO IC, M = 5, N = 3, d = 2. 



V. Simulation Results and discussion 

A. Performance results using RVQ 

In this section, the performance of the proposed scheme 
is evaluated through numerical simulations. The performance 
metric is the sum-rate evaluated through Monte-Carlo simu- 
lations employing truncated unitary precoders. A three-user 
IC is considered where each BS is equipped with M = 5 
antennas while every receiver has N — 3 antennas and 
d = 2 data streams for each user is considered. Entries of the 
channel matrices are generated according to CAf(0, 1) and the 
performance results are averaged over the channel realizations. 
In in Fig. [2] the quantized CSI feedback method of Section ITVl 
(denoted by "Proposed") is compared (for scenario I) to the 
naive method where the interfering channel matrices from the 
BSs are independently vectorized, normalized and quantized 
using Nf, bits based on the idea of composite Grassmann 
manifold [2] and finally the indices of the quantized vec- 
tors are sent to the central node (denoted by Normalized 
Channel Composite Grassmann Quantization, NC-CGQ). At 
the central node, in both cases, each precoder is vectorized, 
normalized and quantized on GMd.i using N c bits, and sent to 
the corresponding BS. Figure [2] shows the achievable sum- 
rate versus transmit SNR (P) for (N b ,N c ) = (5,6) and 
(Nj,, N c ) — (10, 12) bits. A random codebook is used with 
codebook entries chosen as independent truncated unitary 
matrices generated from the Haar distribution. For the indepen- 
dent quantization method, random unit norm vectors are used 
in the codebook construction. Clearly the proposed scheme 
outperforms the independent quantization method for the same 
number of bits. 

B. Perturbations on the Grassmann manifold 

The use of random codebooks for large values of Nb and 
N c is not tractable, due to the exponential requirements in 



terms of storage and of computation of and (|9). In order to 
benchmark the sum-rate achievable under the proposed scheme 
for the high power region (large Nb and N c ) we replace the 
quantization process with a perturbation which approximates 
the quantization error. As will be seen, this approach provides 
a good approximation of the effect of quantization on the 
considered system. We now detail the proposed perturbation 
technique. 

Let us consider a point on Q n . p , represented by a n x p 
truncated unitary matrix F. Here, we assume that n > 2p, 
since it is otherwise more efficient to consider the left null 
space of F instead. Since the columns of F are orthonormal, 
they can be completed to form an orthonormal basis of the Tri- 
dimensional space. In fact, according to 1111 . any other point 
on Qn ;P can be represented in the basis constituted by the 
columns of the unitary matrix W = [F F c ] as 

C 

F = W S , (21) 

_0„_2p_ 

for some F c in the left null space of F and 

C = diag(cosf?i, • • • , cos(9 p ), S = diag(sinf3 1 , • • • , sinf9 p ), 

where 8\,...,9 P are real angles. The squared chordal distance 
between the two points on Q n<p represented by F and F is 
r = g?;?(F, F) = 2i=i sm2 Therefore, in order to generate 
random perturbations of a certain chordal distance ^/¥q from 
F, we propose to generate random values for the angles 
01, 9 P such that j sin 2 6i — ro, and to pick a random 
orthonormal basis F c of the left null subspace of F. The 
perturbed matrix is then computed using (l2TY The histogram 
(not shown) of the squared quantization error obtained from 
an RVQ implementation suggests that the Gaussian distri- 
bution is a good approximation for the probability density 
function of r. In fact, the moments of this distribution can 



be obtained using the following result from ||9] Theorem 6]: 
for asymptotically large codebook size, when using a random 
codebook C of size J for quantizing a matrix F arbitrarily 
distributed over an arbitrary manifold, the fc-th moment of the 
quantization error distribution, — Ec,f(^c (F, F)) for 

F = argmincee d c (F, C), can be bounded as 



G 



(G + k)(cjy l 



< D {k) < 



r(&) 
Hcjy 



where c and G are respectively the coefficient of the ball 
volume and the real dimension of the corresponding manifold 
(here, the Grassmann manifold). Note that (l22t only provides 
bounds on D^ k \ however since both the upper and lower 
bounds are asymptotically tight when the codebook size in- 
creases, we arbitrarily choose to use the upper bound as an 
approximation of D^ k \ i.e. 



m 

f(cJ)i 



D (2) 



is the average and 
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(23) 



(24) 



is the variance. We propose generate the values for r according 
to Af(f,a 2 ) truncated to M + . This process is summarized in 
Algorithm [T] 

Algorithm 1 Generating random perturbations around F 

• Draw a random realization of the squared chordal dis- 
tance r from Af(f, a 2 ) 

• If r < then generate a new sample 



Generate independent s\, . . 
the interval [0, 1] 

Compute the angles 6i = sL* , — 

Generate a random orthonormal basis F c of the left null 
space of F and compute F according to ( f2Tb . 



This algorithm was used to simulate the effect of quantiza- 
tion taking place at BSs as well as the central node. The sum- 
rate performance J2iLi Rq obtained using the perturbation 
method is plotted against SNR for various codebook sizes in 
Fig. |3] For the considered antenna configuration, according to 
( fT3T l, the scaling that is sufficient to achieve the perfect DoF 
is Nf, — 5 log P and N c = 6 log P. In the simulations, the 
codebook sizes are chosen as Nf, = 5A and N c = QA for 
integer values of A, and the corresponding SNR is computed 
according to P = 2 A . The results are also compared to 
perfect CSIT sharing. It is clear that this perturbation method 
effectively approximates the quantization process when the 
desired performance metric is the sum-rate, allowing us to 
rely on the curves resulting from this method to confirm the 
DoF result of Theorem Q] 
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Fig. 3. Sum rate comparison between perturbation method 
(dashed) and quantization (solid), for the 3-user MIMO IC, 
M = 5, N = 3 and d = 2. 



VI. Conclusion 

A limited feedback scheme was proposed for efficient shar- 
ing of CSIT among interfering BSs in downlink interference 
alignment for TDD cellular systems. The growth rate of the 
bits to be transferred with respect to the transmit power was 
characterized in order to preserve the total multiplexing gain 
and a heuristic method was proposed to verify the achievability 
of multiplexing gain by simulation. 
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